Search results for " Sequence alignment"
showing 10 items of 32 documents
EvalMSA: A Program to Evaluate Multiple Sequence Alignments and Detect Outliers
2016
8 páginas, 3 figuras, 2 tablas.
OFIP/KIAA0753 forms a complex with OFD1 and FOR20 at pericentriolar satellites and centrosomes and is mutated in one individual with oral-facial-digi…
2016
Item does not contain fulltext Oral-facial-digital (OFD) syndromes are rare heterogeneous disorders characterized by the association of abnormalities of the face, the oral cavity and the extremities, some due to mutations in proteins of the transition zone of the primary cilia or the closely associated distal end of centrioles. These two structures are essential for the formation of functional cilia, and for signaling events during development. We report here causal compound heterozygous mutations of KIAA0753/OFIP in a patient with an OFD VI syndrome. We show that the KIAA0753/OFIP protein, whose sequence is conserved in ciliated species, associates with centrosome/centriole and pericentrio…
MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems
2016
This is a pre-copyedited, author-produced version of an article accepted for publication in Bioinformatics following peer review. The version of recordJorge González-Domínguez, Yongchao Liu, Juan Touriño, Bertil Schmidt; MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems, Bioinformatics, Volume 32, Issue 24, 15 December 2016, Pages 3826–3828, https://doi.org/10.1093/bioinformatics/btw558is available online at: https://doi.org/10.1093/bioinformatics/btw558 [Abstracts] MSAProbs is a state-of-the-art protein multiple sequence alignment tool based on hidden Markov models. It can achieve high alignment accuracy at the expense of relatively long runtimes for large-sca…
Parallel algorithms for large-scale biological sequence alignment on Xeon-Phi based clusters
2016
Computing alignments between two or more sequences are common operations frequently performed in computational molecular biology. The continuing growth of biological sequence databases establishes the need for their efficient parallel implementation on modern accelerators. This paper presents new approaches to high performance biological sequence database scanning with the Smith-Waterman algorithm and the first stage of progressive multiple sequence alignment based on the ClustalW heuristic on a Xeon Phi-based compute cluster. Our approach uses a three-level parallelization scheme to take full advantage of the compute power available on this type of architecture; i.e. cluster-level data par…
MAGA: A Supervised Method to Detect Motifs From Annotated Groups in Alignments
2020
Multiple sequence alignments are usually phylogenetically driven. They are studied in the framework of evolution. But sometimes, it is interesting to study residue conservation at positions unconstrained by evolutionary rules. We present a supervised method to access a layer of information difficult to appreciate visually when many protein sequences are aligned. This new tool (MAGA; http://cbdm-01.zdv.uni-mainz.de/~munoz/maga/ ) locates positions in multiple sequence alignments differentially conserved in manually defined groups of sequences.
NOseq: amplicon sequencing evaluation method for RNA m6A sites after chemical deamination
2020
Abstract Methods for the detection of m6A by RNA-Seq technologies are increasingly sought after. We here present NOseq, a method to detect m6A residues in defined amplicons by virtue of their resistance to chemical deamination, effected by nitrous acid. Partial deamination in NOseq affects all exocyclic amino groups present in nucleobases and thus also changes sequence information. The method uses a mapping algorithm specifically adapted to the sequence degeneration caused by deamination events. Thus, m6A sites with partial modification levels of ∼50% were detected in defined amplicons, and this threshold can be lowered to ∼10% by combination with m6A immunoprecipitation. NOseq faithfully d…
Textual data compression in computational biology: Algorithmic techniques
2012
Abstract In a recent review [R. Giancarlo, D. Scaturro, F. Utro, Textual data compression in computational biology: a synopsis, Bioinformatics 25 (2009) 1575–1586] the first systematic organization and presentation of the impact of textual data compression for the analysis of biological data has been given. Its main focus was on a systematic presentation of the key areas of bioinformatics and computational biology where compression has been used together with a technical presentation of how well-known notions from information theory have been adapted to successfully work on biological data. Rather surprisingly, the use of data compression is pervasive in computational biology. Starting from…
Type I keratin cDNAs from the rainbow trout: independent radiation of keratins in fish
2002
Five different type I keratins from a teleost fish, the rainbow trout Oncorhynchus mykiss, have been sequenced by cDNA cloning and identified at the protein level by peptide mass mapping using MALDI-MS. This showed that the entire range of type I keratins detected biochemically in this fish has now been sequenced. Three of the keratins are expressed in the epidermis (subtype Ie), whereas the other two occur in simple epithelia and mesenchymal cells (subtype Is). Among the Is keratins is an ortholog of human K18; the second Is polypeptide is clearly distinct from K18. We raised a new monoclonal antibody (F1F2, subclass IgG1) that specifically recognizes trout Is keratins, with negative react…
Compression-based classification of biological sequences and structures via the Universal Similarity Metric: experimental assessment.
2007
Abstract Background Similarity of sequences is a key mathematical notion for Classification and Phylogenetic studies in Biology. It is currently primarily handled using alignments. However, the alignment methods seem inadequate for post-genomic studies since they do not scale well with data set size and they seem to be confined only to genomic and proteomic sequences. Therefore, alignment-free similarity measures are actively pursued. Among those, USM (Universal Similarity Metric) has gained prominence. It is based on the deep theory of Kolmogorov Complexity and universality is its most novel striking feature. Since it can only be approximated via data compression, USM is a methodology rath…
Changes in membrane lipid composition in ethanol- and acid-adapted Oenococcus oeni cells: characterization of the cfa gene by heterologous complement…
2008
International audience; Cyclopropane fatty acid (CFA) synthesis was investigated in Oenococcus oeni. The data obtained demonstrated that acid-grown cells or cells harvested in the stationary growth phase showed changes in fatty acid composition similar to those of ethanol-grown cells. An increase of the CFA content and a decrease of the oleic acid content were observed. The biosynthesis of CFAs from unsaturated fatty acid phospholipids is catalysed by CFA synthases. Quantitative real-time-PCR experiments were performed on the cfa gene of O. oeni, which encodes a putative CFA synthase. The level of cfa transcripts increased when cells were harvested in stationary phase and when cells were gr…